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[57] ABSTRACT 
Compounds having the structure: 
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wherein B represents a purine; 7-deazap urine, or py- 
rimidine moiety covalently bonded to the C 1 '-posi- 
tion of the sugar moiety, provided that when B is 
purine or 7-deazapurine, it is attached at the Im- 
position of the purine or 7-deazapurine and when B 
is pyrimidine, it is attached at the NLposition; 
wherein A represents a moiety consisting of at least 
three carbon atoms which is capable of forming a 
detectable complex with a polypeptide when the 
compound is incorporated into a double-stranded 
ribonucleic acid, deoxyribonucleic acid duplex, or 
DNA-RNA hybrid; 
wherein the dotted line represents a chemical linkage 
joining B and A, provided that if B is purine, the 
linkage is attached to the 8-position of the purine, if 
B is 7-deazapurine, the linkage is attached to the 
7-position of the deazapurine, and if B is pyrimi- 
dine, the linkage is attached to the 5-position of the 
pyrimidine; and 
wherein each of x, y and z represents 
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either directly, or when incorporated into oligo- 
and polynucleotides, provide probes which are 
widely useful. 

Applications include detection and localization of poly- 
nucleotide sequences in chromosomes, fixed ceils, tissue 
sections, and cell extracts. Specific applications include 
chromosomal karyotyping, clinical diagnosis of nucleic 
acid -containing etiological agents, e.g. bacteria, viruses, 
or fungi, and diagnosis of genetic disorders. 
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METHODS OF USING LABELED NUCLEOTIDES 

The invention disclosed and/or claimed in this appli- 
cation was made in the course of work carried out 
under grants from the Public Health Service, National 
Institute of General Medical Science Research Grant 
P50-GM20124; Public Health Service, National Cancer 
Institute Training Grant T32-CA09159; and Public 
Health Service, National Institute of General Medical 
Science Grant T32-GM07499. "The Government has 
certain rights in the invention. 

This application is a divisional of U.S. patent applica- 
tion Ser. No. 06/496,915, filed on May 23, 1983, now 
U.S. Pat. No. 4.711,955, issued Dec. 8, 1987 which ap- 
plication was a continuation of U.S. application Ser. 
No. 06/255,223, filed on Apr. 17, 1981, abandoned. 

BACKGROUND OF THE INVENTION 
Many procedures employed in biomedical research 
and recombinant DNA technology rely heavily on the 
use of nucleotide or polynucleotide derivatives radioac- 
tively labeled with isotopes of hydrogen ( 3 H), phospho- 
rus ( 32 P), carbon ( 14 C), or iodine ( 12 -I). Such radioac- 
tive compounds provide useful indicator probes that 
permit the user to detect, monitor, localize, or isolate 
nucleic acids and other molecules of scientific or clini- 
cal interest, even when present in only extremely small 
amounts. To date, radioactive materials have provided 
the most sensitive, and in many cases the only, means to 
perform many important experimental or analytical 
tests. There are, however, serious limitations and draw- 
backs associated with the use of radioactive com- 
pounds. First, since personnel who handle radioactive 
material can be exposed to potentially hazardous levels 
of radiation, elaborate safety precautions must be main- 
tained during the preparation, utilization, and disposal 
of the radioisotopes. Secondly, radioactive nucleotides 
are extremely expensive to purchase and use. in large 
pan due to the cost of equipment and manpower neces- 
sary to provide the appropriate safeguards, producer- 
/user health monitoring services, and waste-disposal 
programs. Thirdly, radioactive materials are often very 
unstable and have a limited shelf-life, which further 45 
increases usage costs. This instability results from radio- 
lytic decomposition, due to the destructive effects asso- 
ciated with the decay of the radioisotope itself, and 
from the fact that many isotopes (e.g. 32 P and l23 I) have 
half-lives of only a few days. 

It is known that haptens can combine with antibodies, 
but can initiate an immune response only if bound to a 
carrier. This property can be exploited in detection and 
identification testing. 

It is also known that biotin and iminobiotin strongly 
interact with avidin, a 68,000 dalton glycoprotein from 
egg white. This interaction exhibits one of the tightest, 
non-covalent binding constants (Krfu= 10 -13 ) seen in 
nature. If avidin is coupled to potentially demonstrable 
indicator molecules, including fluorescent dyes, e.g. 
fluorescein or rhodamine; electron-dense reagents, e.g. 
ferritin, hemocyanin. or colloidal gold; or enzymes 
capable or depositing insoluble reaction products, e.g. 
peroxidase or alkaline phosphatase, the presence, loca- 
tion, or quantity of a biotin probe can be established. 
Although iminobiotin binds avidin less tightly than 
biotin, similar reactions can be used for its detection. 
Moreover, the reversibility of the iminobiotin-avidin 
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interaction, by decreasing solution pH, offers significant 
advantages in certain applications. 

The specificity and tenacity of the biotin-avidin com- 
plex has been used in recent years to develop methods 
for visually localizing specific proteins, lipids, or carbo- 
hydrates on or within cells (reviewed by E.A. Bayer 
and M. Wilchek in Methods of Biochemical Analysis, 
26, 1, 1980). Chromosomal location of RNA has been 
determined by electron microscopy using a biotinized 
protein, cytochrome C, chemically cross-linked to 
RNA as a hybridization probe. The site of hybridization 
was visualized through the binding of avidin-ferritin or 
avidin-methacrylate spheres mediated by the avidin- 
biotin interaction. (J. E. Manning, N. D. Hershey, T. R. 
Broker, M. Pellegrini, H. K. Mitchell, and N. Davidson. 
Caromosoma, 53, 107, 1975; J. E- Manning, M. Pelle- 
grini, and N. Davidson, Biochemistry, 61, 1364, 1977; T. 
R. Broker, L. M. Angerer, P. H. Yen, N. D. Hersey, and 
N. Davidson, Nucleic Acid Res., 5, 363. 1978; A Sodja 
and N. Davidson, Nucleic Acid Res., 5, 383. 1978.) This 
approach to the detection of polynucleotide sequences, 
although successful in the specialized cases examined 
which were highly reitterated sequences, is not of gen- 
eral utility for analysis of polynucleotides present in 
single or low copy number. 

Moreover, methods for attaching chemical moieties 
to pyrimidine and purine rings are known. Several years 
ago a simple and rapid acetoxymercuranon reaction 
was developed for introducing covalently bound mer- 
cury atoms into the 5-position of the pyrimidine ring, 
the C-8 position of the purine ring or the C-7 position of 
a 7-deazapurine ring, both in nucleotides and polynu- 
cleotides. (R. M. K. Dale. D. C Livingston and D. C. 
Ward, Proc. Natl. Acad. Sci. U.S.A., 70, 2238, 1973; R. 
M. K. Dale. E. Martin. D. C. Livingston and D. C 
Ward, Biochemistry, 14; 2447, 1975.) It was also shown 
several years ago that organomercurial compounds 
would react with olefmic compounds in the presence of 
palladium catalvsts to form carbon-carbon bonds (R. F. 
Heck, J. Am. Chem. Soc., 90, 5518. 1968; R. F. Heck, 
Ibid., 90, 5526, 1968; R. F. Heck. Ibid., 90, 5531, 1968; 
R. F. Heck. Ibid.. 90, 5535, 1968; and R. F. Heck, J. Am. 
Chem. Soc. 91, 6707, 1969.) Bergstrom and associates 
(J. L. Ruth and D. E. Berstrom, J. Org. Chem., 43, 2870, 
1978; and D. E. Bergstrom and M. K. Ogawa, J. Am. 
Chem. Soc, 100, 8106, 1978) and Bigge, et al. (C F. 
Bigge, P. Kaiaritis, J. R. Deck and M. P. Mertes, J. Am. 
Chem. Soc., 102, 2033, 1980) have recently applied this 
reaction scheme in the synthesis of C-5 substituted py- 
rimidine nucleotide compounds. 

Finally, it is known that antibodies specific for modi- 
fied nucleotides can be prepared and used for isolating 
and characterizing specific constituents of the modified 
nucleotides. (T. W. Munns and M. K. Liszewski, 
Progress in Nucleic Acid Research and Molecular Biol- 
ogy, 24, 109, 1980.) However, none of the antibodies 
prepared to data against naturally occurring nucleotides 
have been shown to react with their nucleotide determi- 
nant when it exists in a double-stranded RNA or DNA 
duplex or when in DNA-RNA hybrid molecules. 

To circumvent the limitations of radioactively la- 
beled probes or previously utilized chemical and biolog- 
ical probes, a series of novel nucleotide derivatives that 
contain biotin, iminobiotin. lipoic acid, and other deter- 
minants attached covalently to the pyrimidine or purine 
ring have been synthesized. These nucleotide deriva- 
tives, as well as polynucleotides and coenzymes that 
contain them, will interact specifically and uniquely 
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with proteins such as avidin or antibodies. The interac- 
tion between modified nucleotides and specific proteins 
can be utilized as an alternative to radioisotopes for the 
detection and localization of nucleic acid components in 
many of the procedures currently used in biomedical 
and recombinant-DNA technologies. Methods employ- 
ing these modified nucleotide-protein interactions have 
detection capacities equal to or greater than procedures 
which utilize radioisotopes and they often can be per- 
formed more rapidly and with greater resolving power. 

These new nucleotide derivatives can be prepared 
relatively inexpensively by chemical procedures which 
have been developed and standarized as discussed more 
fully hereinafter. More significantly, since neither the 
nucleotide probes of this invention nor the protein rea- 
gents employed with them are radioactive, the com- 
pounds can be prepared, utilized, and disposed of, with- 20 
out the elaborate safety procedures required for radio- 
isotopic protocols. Moreover, these nucleotide deriva- 
tives are chemically stable and can be expected to have 
functional shelf-lives of several years or more. Finally, 2 5 
these compounds permit the development of safer, more 
economical, more rapid, and more reproducible re- ■ 
search and diagnostic procedures. 

SUMMARY OF THE INVENTION 

Compounds having the structure: 
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widely useful as probes in biomedical research and re- 
combinant DNA technology. 

Particularly useful are compounds emcompassed 
within this structure which additionally have one or 
more of the following characteristics: A is non- 
aromatic; A is at least C 5 ; the chemical linkage joining 
B and A includes an a-olefinic bond; A is biotin or 
iminobiotin; and B is a pyrimidine or 7-deazapurine. 

These compounds may be prepared by a process 
which involves: 

(a) reacting a compound having the structure: 



x-CH 2q 




H H/ 



with a mercuric salt in a suitable solvent under suitable 
30 conditions so as to form a mercurated compound hav- 
ing the structure: 



x-CH2 Q B — A 



x-CH 2n B-Hg- 
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wherein B represents a purine, deazapurine, or pyrim- 
idine moiety covalently bonded to the O'-position of 
the sugar moiety, provided that when B is purine or 
7-deazapurine, it is attached at the N^-position of the 
purine or deazapurine, and when B is pyrimidine, it is 43 
attached at the ^-position; 

wherein A represents a moiety consisting of at least 
three carbon atoms which is capable of forming a de- 
tectable complex with a polypeptide when the com- 
pound is incorporated into a double-stranded ribonu- 
cleic acid, deoxyribonucleic acid duplex, or DNA- 
RNA hybrid; 

wherein the dotted line represents a chemical linkage 55 
joining B and A, provided that if B is purine the linkage 
is attached to the 8-position of the purine, if B is 7- 
daazapurine, the linkage is attached to the 7-position of 
the deazapurine, and if B is pyrimidine, the linkage is 
attached to the 5-position of the pyrimidine; and 

wherein each of x, y, and z represents 



(b) reacting said mercurated compound with a chemi- 
cal moiety reactive with the — Hg+ portion of said 
mercurated compound and represented by the formula 
•**N, said reaction being carried out in an aqueous sol- 
vent and in the presence of K 2 PdCU under suitable 
conditions so as to form a compound having the struc- 
ture: 
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wherein N is a reactive terminal functional group or is 
A; and 

(c) recovering said compound as said modified nucle- 
otide when N is A, or when N is a reactive terminal 
group, reacting said compound with a compound hav- 
ing the structure M-A, wherein M represents a func- 
tional group reactive with N in an aqueous solvent 
under suitable conditions so as to form said modified 
nucleotide, which is then recovered. 

This invention also provides compounds having the 
structure: 
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wherein each of B, B\ and B" represents a purine, 
7-deazapurine, or pyrimidine moiety covalently bonded 
to the Opposition of the sugar moiety, provided that 
whenever B. B\ or B" is purine or 7-deazapurine, it is 
attached at the N^-position of the purine or 7-deazapu- 
rine, and whenever B, B\ or B" is pyrimidine, it is at- 
tached at the N 1 -position: 

wherein A represents a moiety consisting of at least 
three carbon atoms which is capable of forming a de- 
tectable complex with a polypeptide when the com- 



25 



DETAILED DESCRIPTION OF THE 
INVENTION 



Several essential criteria must be satisfied in order for 
30 a modified nucleotide to be generally suitable as a sub- 
stitute for a radioactively- labeled form of a naturally 
occurring nucleotide. First, the modified compound 
must contain a substituent or probe that is unique, Le., 
not normally found associated with nucleotides or 
pound is incorporated into a double-stranded duplex 35 polynucleotides. Second, the probe must react specifi- 
formed with a complementary ribonucleic or deoxyri- cally with chemical or biological reagents to provide a 
bonucleic acid molecule. sensitive detection system. Third, the analogs must be 

wherein the dotted line represents a chemical linkage relatively efficient substrates for commonly studied 
joining B and A, provided that if B is purine the linkage nucleic acid enzymes, since numerous practical appiica- 
is attached to the 8-position of the purine, if B is 7- 40 dons require that the analog be enzymaticaily metabo- 
deazapurine, the linkage is attached to the 7-position of lized, e.g. t the analogs must function as substrates for 
the deazapurine, and if B is pyrimidine, the linkage is nucleic acid polymerases. For this purpose, probe moi- 
attached to the 5-position of the pyrimidine; eties should not be placed on ring positions that steri- 

w herein z represents H — or HO — ; and cally, or otherwise, interfere with the normal Watson- 

wherein m and n represent integers from 0 up to 45 Crick hydrogen bonding potential of the bases. Other - 
about 100,000. wise, the substituents will yield compounds that are 

These compounds can be prepared by enzymatic inactive as polymerase substrates. Substitution at ring 
polymerization of a mixture of nucleotides which in- positions that alter the normal "anti" nucleoside confor- 
clude the modified nucleotides of this invention. Alter- mation also must be avoided since such conformational 
natively, nucleotides present in oligo- or polynucleo- 50 changes usually render necleotide derivatives unaccept- 
tides may be modified using chemical methods. able as polymerase substrates. Normally, such consider- 

Nucleotides modified in accordance with the prac- ations limit substitution positions to the 5-position of a 
tices of this invention and oligo- and polynucleotides pyrimidine and the 7-position of a purine or a 7- 



into which the modified nucleotides have been incorpo- 
rated may be used as probes in biomedical research, 
clinical diagnosis, and recombinant DNA technology. 
These various utilities are based upon the ability of the 
molecules to form stable complexes with polypeptides 
which in turn can be detected, either by means of prop- 



deazapurine. 

55 Fourth, the detection system should be capable of 
interacting with probe substituents incorporated into 
both single-stranded and double-stranded polynucleo- 
tides in order to be compatible with nucleic acid hybrid- 
ization methodologies. To satisfy this criterion, it is 



erties inherent in the polypeptide or by means of detect- 60 preferable that the probe moiety be attached to the 

able moieties which are attached to, or which interact purine or pyrimidine through a chemical linkage or 

with, the polypeptide. "linker arm" so that it can readily interact with antibod- 

Some uses include detecting and identifying nucleic ies, other detector proteins, or chemical reagents, 
acid-containing etiological agents, e.g. bacteria and Fifth, the physical and biochemical properties of 

viruses; screening bacteria for antibiotic resistance; di- 65 polynucleotides containing small numbers of probe 

agnosing genetic disorders, e.g. thalassemia and sickle substituents should not be significantly altered so that 

cell anemia; chromosomal karyotyping; and identifying current procedures using radioactive hybridization 

tumor cells. probes need not be extensively modified. This criterion 



5,328,824 



3 



must be satisfied whether the probe is introduced by 
enzymatic or direct chemical means. 

Finally, the linkage that attaches the probe moiety 
should withstand all experimental conditions to which 
normal nucleotides and polynucleotides are routinely 
subjected, e.g., extended hybridization times at elevated 
temperatures, phenol and organic solvent extraction, 
electrophoresis, etc. 

All of these criteria are satisfied by the modified nu- 
cleotides described herein. 

These modified nucleotides have the structure: 



x-CH: Q B -a 



10 



not useful. Presently preferred bases are cytosine, ura- 
cil, deazaadenine and deaza guanine. 

A may be any moiety which has at least three carbon 
atoms and is capable of forming a detectable complex 
with a polypeptide when the modified nucleotide is 
incorporated into a double-stranded duplex containing 
either deoxyribonucleic or ribonucleic acid. 

A therefore may be any ligand which possesses these 
properties, including haptens which are only immuno- 
genic when attached to a suitable carrier, but are capa- 
ble of interacting with appropriate antibodies to pro- 
duce complexes. Examples of moieties which are useful 
include: 



y * 



15 



wherein B represents a purine. 7-deazapunne. or py- 
rimidine moiety covalentiy bonded to the C 1 -position 
of the sugar moiety, provided that when B is punne or 
7^deazapurine, it is attached at the N^-position of the 
purine or 7-deazapurine, and when B is pynmidine, it is 
attached at the ^-position; 

wherein A represents a moiety consisting of at least 
three carbon atoms which is capable of forming a de- 
tectable complex with a polypeptide when the com- 
pound is incorporated into a double-stranded ribonu- 
cleic acid, deoxyribonucleic acid duplex, or DNA- 
RNA hybrid; 

wherein the dotted line represents a linkage group 
joining B and A, provided that if B is purine the linkage 
is attached to the 8-position of the purine, if B is 7- 
deazapurine, the linkage is attached to the 7-posmon of 35 
the deazapurine, and if B is pyrimidine, the linkage is 
attached to the 5-position of the pyrimidine; and 

wherein each of x, y and z represents 
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These compounds are widely useful as probes in bio- 
medical research and recombinant DNA technology. 

Although in principal all compounds encompassed 
within this structural formula may be prepared and used 
in accordance with the practices of this invention, cer- 
tain of the compounds are more readily prepared or 
used or both, and therefore are presently preferred. 

Thus, although purines, pyrimidines and 7-deazapu- 
rines are in principal useful, pyrimidines and 7-deazapu- 
rines are preferred since purine substitution at the 3- 
position tends to render the nucleotides ineffective as 
polymerase substrates. Thus, although modified purines 
are useful in certain respects, they are not as generally 
useful as pyrimidines and 7-deazap urines. Moreover, 
pyrimidines and 7-deazapurines useful in this invention 65 
must not be naturally substituted at the 5-or 7-positions. 
respectivelv. As a result, certain bases such as thymine, 
5-methvIcvtosine. and 5-hydroxymethyicytosine are 
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Of these the preferred A moieties are biotin and 
iminobiotin. . 

Moreover, since aromatic moieties tend to intercalate 
into a base-paired helical structure, it is preferred that 
the moiety A be nonaromatic. Also, since smaller moi- 
eties may not permit sufficient molecular interaction 
with polypeptides, it is preferred that A be at least C 5 so 
that sufficient interaction can occur to permit formation 
of stable complexes. Biotin and iminobioan satisfy both 
of these criteria. 

The linkage or group joining moiety A to base B may 
include any of the well known bonds including carbon- 
carbon single bonds, carbon-carbon double bonds, car- 
bon-nitrogen single bonds, or carbon-oxygen single 
bonds. However, it is generally preferred that the chem- 
ical linkage include an olefinic bond at the a-posmon 
relative to B. The presence of such an a-olefinic bond 
serves to hold the moiety A away from the base when 
the base is paired with another in the well known dou- 
ble-helix configuration. This permits interaction with 
polypeptide to occur more readily, thereby facilitating 
complex formation. Moreover, single bonds with 
greater rotational freedom may not always hold the 
moiety sufficiently apart from the helix to permit recog- 
nition by and complex formation with polypeptide. 
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It is even more preferred that the chemical linkage 
group be derived from a primary amine, and have the 
structure — CH2 — NH — , since such linkages are easily 
formed utilizing any of the well known amine modifica- 
tion reactions. Examples of preferred linkages derived 
from allyiamine and ally K3-amino-2-hydroxy- 1 -pro- 
pyl) ether groups have the formulae 

— CH=CH— CH 2 — NH— 
and 

-CH=CH— CT 2 — O— CH 2 — CH— CH 2 — NH— 
OH 



10 



15 



20 



respectively. 

Although these linkages are preferred, others can be 
used, including particularly olefin linkage arms with 
other modifiable functionalities such as thiol, carboxylic 
acid, and epoxide functionalities. 

The linkage groups are attached at specific positions, 
namely, the 5-position of a pyrimidine, the 3-position of 
a purine, or the 7-position of a deazapurine. As indi- 
cated previously, substitution at the 8-position of a pu- 25 
rine does not produce a modified nucleotide which is 
useful in all the methods discussed herein. It may be that 
the 7-position of a purine, which is occupied by a nitro- 
gen atom, could be the point of linkage attachment. 
However, the chemical substitution methods employed 30 
to data and discussed herein are not suitable for this 
purpose. 

The letters x. y, and 2 represent groups attached to 
the 5\ 3', and 2' positions of the sugar moiety. They may 
be any of 35 
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-continued 
— CH=CH— CH 2 — O— CH 2 — CH— CH 2 — NH— , 
OH 

and B is uracil or cytosine. 

The general synthetic approach adopted for intro- 
ducing the linker arm and probe moiety onto the base is 
discussed hereinabove. (See especially, J. L. Ruth and 
D. E. Bergstrom, J. Org. Chem., 43, 2870, 1978; D. E. 
Bergstrom and M. K. Ogawa, J. Amer. Chem. Soc. 100, 
8106, 1978; and C F. Bigge, P. Kalaritis, J. R. Deck and 
M. P. Mertes, J. Amer. Chem. Soc. 102, 2033, 1980.) 
However, the olefin substituents employed herein have 
not been used previously. To facilitate attachment of 
probe moiety A, it has been found particularly desirable 
to employ olefins with primary amine functional 
groups, such as allyiamine [AA] or allyl-{3-amino-2- 
hydroxy-1 -propyl) ether [NAGE], which permit probe 
attachment by standard amine modification reactions, 
such as, 
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Although conceivable, it is unlikely that all of x, y, and 
2 will simultaneously be the same. More likely at least 
one of x, y, and 2 will be a phosphate-containing group, 
- either mono-, di-, or tri-phosphate and at least one will 
be HO— or H— . As will be readily appreciated, the 
most likely identity of 2 will be HO — or H — indicating 
ribonucleotide or deoxyribo nucleotide, respectively. 
Examples of such nucleotides include 5'-ribonucleoside 
monophosphates, 5'-ribonucleoside diphosphates, 5'- 
ribonucleoside triphosphates. 5'-deoxyribonucleoside 
monophosphates, 5'-deoxyribonucleoside diphosphates, 
5'-deoxyribonucleoside triphosphates, 5'p-ribcnucleo- 
side-3'p, and 5'p-deoxyribonucleoside-3'p. More spe- 
cific examples include modified nucleotides of this type 
in which A is biotin or iminobiotin, the chemical linkage 
is 

— CH=CH— CH 2 — NH— 
or 



NH 2 
II 

— CH 2 NH 2 + R— C— OR- 
Imidate 



NH; 

->— CH 2 NHCR 
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II 
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NHS-ester (N-hydroxysuccinimide) 
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— CH 2 NH 2 + R — N=C=S- 
Iaothiocyanate 
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II 




-^— CH 2 NHCNHR 

OH 
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— CH 2 NHCH 2 CHR 



Because of ease of preparation it has been found prefera- 
60 ble to use NHS-esters for probe addition. However, 
olefin linker arms with other modifiable functional 
groups, such as thiols, carboxylic acids, epoxides, and 
the like, can also be employed. Furthermore, both 
linker arm and probe can be added in a single-step if 
deemed desirable. 

Specifically, modified nucleotides having the struc- 
ture: 
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wherein B represents a purine, 7-deazapurine, or py- 
rimidine moiery covalently bonded to the exposition 
of the sugar moiery, provided that when B is purine or 
7-deazapurine, it is attached at the N^-position of the 
purine or deazapurine, and when B is pyrimidine, it is 
attached at the N ^position; 

wherein A represents a moiety consisting of at least 
three carbon atoms which is capable of forming a de- 
tectable complex with a polypeptide when the com- 
pound is incorporated into a double-stranded ribonu- 
cleic acid, deoxyribonucleic acid duplex* DNA-RNA 
hybrid; 

wherein the dotted line represents a chemical linkage 
joining B and A, provided that if B is purine, the linkage 
is attached to the 8-position of the purine, if 7-deazapu- 
rine, the linkage is attached to the 7-posirion of the 
deazapurine, and if B is pyrimidine, the linkage is at- 
tached to the 5-position of the pyrimidine; and 

wherein each of x, y, and z represents 
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can be prepared by: 
(a) reacting a compound having the structure: 

x-CHz 0 B 




10 



15 



y * 



wherein N is a reactive terminal functional group or is 
A; and 

(c) recovering said compound as said modified nucle- 
otide when N is A, or when N is a reactive terminal 
group, reacting said compound with a compound hav- 
ing the structure M-A, wherein M represents a func- 
tional group reactive with N in an aqueous solvent 
under suitable conditions, so as to form said modified 
nucleotide, which is then recovered. 

The following schema is illustrative: 
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with a mercuric salt in a suitable solvent under suitable 
conditions so as to form a mercurated compound hav- 
ing the structure: 
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(b) reacting said mercurated compound with a chemi- 
cal moiety reactive with the -Hg~ portion of said mer- 
curated compound and represented by the formula 
•••N, said reaction being carried out in an aqueous sol- 
vent and in the presence of K 2 PdCU under suitable 
conditions so as to form a compound having the struc- 
ture: 



65 



Although the reactions can be carried out at hydro- 
gen ion concentrations as low as pH 1, or as high as pH 
14, it is preferred to operate in the range from about 4 to 
8. This is especially true when dealing with unstable 
compounds such as nucleoside polyphosphates, polynu- 
cleotides, and nucleotide conenzymes which are hydro- 



13 



5,328,824 - 



lyzed at pH*s outside this range. Similarly, it is preferred 
to operate at a temperature in the range from about 20* 
C to 30* C. to avoid possible decomposition of labile 
organic substrates. However, the reactions can be car- 
ried out at temperatures from about 5 m C to 100* C As 
is usual with chemical reactions, higher temperatures 
promote the reaction rate and lower temperatures re- 
tard it. Thus, in the temperature range from 5* C to 
100* C, the optimum reaction time may vary from 
about 10 minutes to 98 hours. In the preferred tempera- 
ture range, reacrion times normally vary from about 3 
to 24 hours. 

The preferred procedure for maintaining the pH in 
the desired range is through the use of buffers. A variety 
of buffers can be employed. These include, for example, 
sodium or potassium acetate, sodium or potassium ci- 
trate, potassium citrate-phosphate, tris-acetate and bo- 
rate-sodium hydroxide buffers. The concentration of 
buffer, when employed, can vary over a wide range, up 
to about 2.0 molar. 

While a particular advantage of the mercurarion and 
palladium catalyzed addition reactions is that they can 
be carried out in water, small amounts of an organic 
solvent can be usefully included as a solubility aid. The 
organic solvents usually chosen are those which are 
miscible with water. These may be selected from ethers, 
alcohols, esters, ketones, amides, and the like such as 
methanol, ethanol, propanoi, glycerin, dioxane, ace- 
tone, pyridine and dimethylformamide. However, since 
it has been observed that the presence of alcohols, such 
as methanol, often results in aikoxy-addition across the 
olefin double bond, any organic solvent used as a solu- 
bility aid should be chosen carefully. Introduction of 
alkoxy substituents to the a - or 0- exocyclic carbon 
atoms often results in the production of compounds 
which are utilized much less efficiently as enzyme sub- 
strates. 

Although various mercuric salts may be utilized, the 
presently preferred salt is mercuric acetate. Also, as 
indicated previously, the compounds may be prepared 
by first adding a linker arm and then the moiety A, or by 
adding a linker arm to which A is already attached. 
Thus, the chemical moiety represented by the formula 
•••N may be any one of the numerous entities which 
ultimately result in production of the desired com- 
pounds. 

Examples include — CH = CH— CH2— NH 2 , 
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CH=CH CHi — O — CHj — CH — CH2 — NH2, 
OH 

— CH=CH— CH 2 — NH-biorin. 
and 

— CH=CH— CH 2 — O— CH 2 — CH— CH : — NH-iittinobiorin 
OH 

The amounts of the reactants employed in these reac- 
tions may vary widely. However, in general the 
amounts of unmercurated compound, mercurated com- 
pound, and palladium-containing compound will be 
substantially stoichiometric whereas the mercuric salt 
and compound •••N will be present in molar excess, 
e.g. 5-20 moles of •••N or of mercuric salt per mole of 
mercurated compound or unmercurated compound, 
respectively. In practice, amounts will vary depending 
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upon variations in reaction conditions and the precise 
identity of the reactants. 

Having the biotin probe directly attached to nucleo- 
tide derivatives that are capable of functioning as en- 
zyme substrates offers considerable versatility, both in 
the experimental protocols that can be performed and in 
the detection methods (microscopic and non-micro- 
scopic) that can be utilized for analysis. For example, 
biotin nucleotides can be introduced into polynucleo- 
tides which are in the process of being synthesized by 
cells or crude cell extracts, thus making it possible to 
detect and/or isolate nascent (growing) polynucleotide 
chains. Such a procedure is impossible to do by any 
direct chemical modification method. Furthermore, 
enzymes can be used as reagents for introducing probes 
such as biotin into highly selective or sitespeciiic loca- 
tions in polynucleotides; the chemical synthesis of simi- 
lar probe-modified products would be extremely diffi- 
cult to achieve at best. 

The synthesis of nucleotides containing biotin or 
iminobiotin was achieved as detailed in the examples set 
forth hereinafter. Pyrimidine nucleoside triphosphates 
containing either of these probes attached to the C-5 
carbon atom were good to excellent substrates for a 
wide variety of purified nucleic acid polymerases of 
both prokaryotic and eukaryotic origin. These include 
DNA polymerase I of R colt, bacteriophage T4 DNA 
polymerase, DNA polymerases a and P from murine 
(A-9) and human (HeLa) cells, and the DNA polymer- 
ase of Herpes simplex virus. Confirming data were ob- 
tained with R colt DNA polymerase I using either the 
nicktransiation condition of Rigby, et al. (P. W. J. 
Rigby, M. Dieckmann, C. Rhodes and P. Berg, J. Moi. 
Biol. 113. 237, 1977) or the gap-filling reaction de- 
scribed by Bourguignon et ai. (G. J. Bourguignon, P. J. 
Tattersall and D. C. Ward, J. Virol. 20, 290, 1976). 
Bio-dUTP has also been found to function as a polymer- 
ase substrate both in CHO cells, permeabilized by treat- 
ment with lysoiecithin according to the method of 
Miller, et ai. (M. R. Miller, J. C. Castellot, Jr. and A. B. 
Pardee, Exp. Cell Res. 120, 421, 1979) and in a nuclear 
replication system prepared from Herpes simplex in- 
fected BHK cells. Although biotinyl ribonucleoside 
triphosphates were found to function as substrates for 
the RNA polymerases of R colt and bacteriophage T7, 
they are not utilized as efficiently as their deoxyribonu- 
cleotide triphosphate counterparts. Indeed, they are 
incorporated poorly, if at all, by the eukaryotic RNA 
polymerases examined (HeLa cell RNA polymerase III, 
calf thymus RNA polymerase II and mouse ceil RNA 
polymerase II). While this limited range of substrate 
function does restrict the utility in some in vivo or in 
vitro transcription studies, biotin- labeled RNA probes 
can be prepared enzymatically from DNA templates 
using R coli or T7 RNA polymerases or by 3' endlabel- 
ing methods using RNA ligase with compounds such as 
biotinyl-pCp. The AA- and NAGE-derivatives of UTP 
are, however, substrates for the eukaryotic RNA 
polymerases mentioned above. With the availability of 
antibodies to these analogs, the isolation of nascent 
transcripts by immunological or affinity procedures 
should be feasible. 

The enzymatic polymerization of nucleotides con- 
taining biotin or iminobiotin substituents was not moni- 
tored directly, since neither of these probes were radi- 
oiabled. However, two lines of experimental evidence 
clearly show that the biotinyl-nucleotides were incor- 
porated. The first is that polynucleotides synthesized in 
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TABLE II 
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15 



10 aiM Tra-HCL 8.3 
50mM N»C1 

(1) aiMN»a 

(2) 1.0MN»a 

(3) 8M Ure* 

(4) 6M guwiidine-HCl 

(5) 50 oM N*ii-acet»te, 
pH 4.0 

(6) 50 mM NH*-«=ettw, 
pH 4.0 

2 mM btoun 



8.7 

<ai 

<0.0l 
<0.01 

<aoi 
<aoi 

<o.oi 



100 

100 

100 
97.3 
97.0 

<0.01 

<aoi 



99.7 

99.7 
994 
98.5 
97.0 
96.3 

<0.01 
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■"SELECTIVE RETENTION F BIOTINLZtD UNA 
SELECT! ^ AV/Tr>lK . SEPHARO sE 



T-DNA 
T-DNA 
T-DNA 
Bio-DNA 
Bio-DNA 
Bio-DNA 



Ana-Bio IgG 
Non-immune IgG 

Anti-Bia IgG 
Non-immune IgG 



70 
87 
55 
53 
3347 
60 



4867 
5197 
5107 
3886 
736 
3900 



Eluem 



% DNA detained on Resin _ 
Bin-PSA (1%) T-DNA 



30 



Load 



3 X 10* cpm 
10 mM Tris 7.5 



100 



•NT pBR-322 DNA.>^ !% Btoon «bsutuoon. 
Soeofic icnvuy. 2 X 10* cpm/ug 
Skjoa detection 0.001-0-01 pmoie*. 

Thus, it is possible to prepare novel compounds having 
the structure: 




-0-CH Jo .-A 



-P— 
I 

OH 




OH 



0.2M N»Q 

(1) 0.5M N«a 

(2) 1.0M Nad 

(3) 8M Urea 

(4) 6M guanidine-HCl 

(5) 99% fonnamidc 

(6) 2 mM Btoun 

(7) 50 mM NaOH 



100 
99.7 

100 
95.2 
9*.7 
97.6 
89.5 



0.1 
<0.0l 
<0.01 
<0.01 
<0.0t 
<0.01 
<0.01 



.tod. =«h o, B. zj*r"5SZ£ & 

whenever o.o,orfl » F - ltr i ne or deazapu- 

attached at the *1»« W %?J 
rine, and whenever B, a , or n » pj 
uched at the N'-posmon; Qf ^ ^ 

65 wherein A ^^^S^ of forming a de- 
three carbon ™f ^~L ide when the com- 
• ISSf h a £££5 mt^uble-stranded duple, 
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formed with a complementary ribonucleic or deoxyri- 
bonucleic acid molecule. 

wherein the dotted line represents a linkage group 
joining B and A, provided that if B is purine, the linkage 
is attached to the 8-position of the purine, if B is 7- 
deazapurine, the linkage is attached to the 7-posirion of 
the deazapurine, and if B is pyrimidine, the linkage is 
attached to the 5-position of the pynmidine; 

wherein z represents H — or HO — ; and 

wherein m and n represent integers from 0 up to 
about 100,000. 

Of course, it should be readily understood that in 
general m and n will not simultaneously be 0 since, in 
that event, the compound becomes merely a modified 
nucleotide as described previously. In general B' and B" 
will vary within the same oligo- or polynucleotide, 
being alternatively uracil, cytosine, thymine, guanine, 
adenine, or the like. Also, in general, the variation will 
correspond to the ordered sequence of nucleotides 
which codes for the synthesis of peptides according to 
the well known Genetic Code. However, it is intended 
that the structure shown also embrace polynucleotides 
such as poly C, poly U, poly r(A-U), and poly d(A-U) as 
well as calf thymus DNA, ribosomal RNA of E. coli or 
yeast, bacteriophage RNA and DNA (R17, fd), animal 
viruses (SV40 DNA), chromosomal DNA, and the like, 
provided only that the polynucleotides be modified in 
accordance with this invention. 

It is also to be understood that the structure embraces 
more than one modified nucleotide present in the oligo- 
mer or polymer, for example, from two to thirty modi- 
fied nucleotides. The critical factor in this regard is that 
the number of modifications not be so great that the 
polynucleotide is rendered ineffective for the intended 
use. 

Finally, it should be understood that modified oligo- 
and polynucleotides can be joined to form larger enti- 
ties having the same structure so long as terminal 
groups are rendered compatible or reactive. 

These compounds can be made by enzymatic poly- 
merization of appropriate nucleotides, especially nucle- 
otide triphosphates in the presence of a nucleic acid 
template which directs synthesis under suitable condi- 
tions. Such conditions can vary widely depending upon 
the enzyme employed, amounts of nucleotides present, 
and other variables. Illustrative enzymes include DNA 
polymerase I of £. coli, bacteriophage T4 DNA poly- 
merase, DNA polymerases a and 0 from murine and 
human (HeLa) cells, DNA polymerase from Herpes 
^simplex virus, RNA polymerase of & coli, RNA poly- 
merase of bacteriophage T7, eukaryotic RNA polymer- 
ase including HeLa cell RNA polymerase III, calf thy- 
mus RNA polymerase II, and mouse ceil RNA poly- 
merase II. 

Also, the compounds can be prepared by terminal 
addition to oligo- or polynucleotides to produce com- 
pounds in which m or n is 0 depending upon whether 
the addition is at the 5' or 3' position. Moreover, the 
compounds such as pCp or pUp in which the base is 
biotinized can be added to existing molecules employ- 
ing the enzyme RNA ligase. 

Modified oligo- and polynucleotides can also be pre- 
pared by chemical modification of existing oligo- or 
polynucleotides using the approach described previ- 
ously for modification of individual nucleotides. 

The various modified nucleotides, oligonucleotides, 
and polynucleotides of this invention may be detected 
by contacting the compounds with polypeptides which 
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are capable of forming complexes therewith under suit- 
able conditions so as to form the complexes, provided 
that the polypeptides include one or more moieties 
which can be detected when the complex or complexes 
is or are formed, generally by means of conventional 
detection techniques. 

One polypeptide detector for the biotinyl-type probe 
is avidin. The avidin-biotin interaction exhibits one of 
the tightest non-covalent binding constant 
(Krfu= 10 s * 13 ) seen in nature. If avidin is coupled to 
potentially demonstrable indicator molecules, e.g., fluo- 
rescent dyes (fluorescein, rhodamine), electron-dense 
reagents (ferritin, hemocyanin, colloidal gold), or en- 
zymes capable of depositing insoluble reaction products 
(peroxidase, alkaline phosphatase) the presence, loca- 
tion and/or quantity of the biotin probe can be estab- 
lished. 

Avidin has, unfortunately, one property that makes it 
less desirable as a biotin-indicator protein when used in 
conjunction with nucleic acids or chromatin material. It 
has been reported (M. H. Heggeness, Stain Technol., 
52, 165, 1977; M. H. Heggeness and J. F. Ash, J. Cell. 
Biol., 73, 783, 1977; E. A. Bayer and M. Wilchek, Meth- 
ods of Biochemical Analysis 26, 1, 1980) that avidin 
binds tightly to condensed chromatin or to subcellular 
fractions that contain large amounts of nucleic acid in a 
manner which is independent of its bio tin-binding prop- 
erty. Since avidin is a basic glycoprotein with a pi of 
10.5, its histone-like character or its carbohydrate moi- 
eties are most likely responsible for these observed non- 
specific interactions. 

A preferred probe for biotm-containing nucleotides 
and derivatives is streptavidin, an avidin-like protein 
synthesized by the soil organism Streptomyces avidinii. 
Its preparation and purification is described in Hoffman, 
et al, Proc. Natl. Acad. Sci., 77, 4666 (1980). Streptavi- 
din has a much lower pi (5.0), is non-glycosyiated, and 
shows much lower non-specific binding to DNA than 
avidin, and therefore offers potential advantages in 
applications involving nucleic acid detection methodol- 
ogy* 

A most preferred protein for biotin-like probe detec- 
tion is monospecific rabbit IgG, antibiotin immunoglob- 
ulin. This compound was prepared by immunizing rab- 
bits with bovine serum albumin conjugated biotin as 
described previously (M. Berger, Methods in Enzymol- 
ogy, 62, 319 [1979]) and purified by affinity chromatog- 
raphy. Although the association constant of immuno- 
globulin-haptens have values of Kaon (I0 6 to I0 10 ) 
which are considerably lower than for avidin-biotin 
complexes, they are substantially equivalent to those 
observed with the avidm-iminobiotin complex. Further- 
more, the anti-biotin antibodies have proven extremely 
useful in detecting specific polynucleotide sequences on 
chromosomes by in situ hybridization since little, if any, 
non-specific binding of the antibody to chromatin mate- 
rial occurs. 

The modified polynucleotides of this invention are 
capable of denaturation and renaturation under condi- 
tions compatible with their use as hybridization probes. 
An analysis of the thermal denaturation profiles and 
hybridization properties of several biotin-substituted 
DNA and RNA polymers clearly indicates this. For 
example, pBR 322 DNA or \ DNA, nick translated to 
introduce approximately 10-100 biotin residues per 
kilobase, have Tm values essentially identical to that of 
the control, biotin-free DNAs. Furthermore. 32 p- 
labeled, biotin-substituted, pBR 322 DNA, exhibited the 
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same degree of specificity and autoradiographic signal 
intensity as control, thyrnidine-containing DNA, when 
used as a hybridization probe for detecting bacterial 
colonies containing the plasmid. 

In DNA duplexes, such as MVM RF DNA, in which 
every thymidine residue in one strand (1250 in toto per 
5 Kb) is replaced by a biotinyl-nucleotide, the Tin is 
only 5" C. less than that of the unsubstimted control. 
Although the Tm of poly d(A-bioU) in which each base 
pair contains a bio-dUMP residue is 15* C. lower than 
the poly d(A-T) control, the degree of cooperarivity 
and the extent of hyperchromicity observed both dur- 
ing denaturation and renaturation were the same for the 
two polymers. A parallel analysis of RNA duplexes and 
DNA/RNA hybrids indicates that their TnVs also de- 
crease as the biotin-content of the polymer increases. 
However, it is clear that a substantial number of biotin- 
molecuies can be introduced without significantly alter- 
ing the hybridization characteristics of the polymers. 

These results strongly suggested that biotin-sub- 
stituted polynucleotides could be used as probes for 
detecting and/or localizing specific polynucleotide se- 
quences in chromosomes, fixed ceils, or tissue sections. 
The general protocol for detecting the biotin-sub- 
stituted probe is schematically illustrated as follows: 

GENERAL PROTOCOL FOR PROBE DETECTION 
VIA IN SITU, COLONY. OR NORTHERN/SOUTHERN 
HYBRIDIZATION METHODS 



Ann probe sequence 



1) Target 
Delivery 



Hybridize with biotinized or 
haptemzed probe (with or with- 
out cloning vehicle sequences) 



2) Signal 

Amplification 



6 6 6 




V 



1) Avidin-peroxidase 

2) I gG- peroxidase 

3) Primary a-determinent IgG 




6 6 6 
Y y V 

1 ® ® ® 



(2) 



(3) 




3) Detection: I) Insoluble peroxidase products: DAB 

2} Antibody sandwiching techniques 
-^^^ m Bioon or Hapten 

This general scheme illustrates only procedures used 
for gene mapping (cytogenetics), and recombinant 
DNA-technologies. However, it can be equally well 
applied to the detection of nucleic acid sequences of 
bacterial, viral, fungal or parasite origin in clinical sam- 
ples and this forms the basis of a powerful new ap- 
proach to clinical diagnostics which does not rely on 
the use of radioisotopes. 



Immunological and histochemical methods for the 
detection of biotin have shown that the basic approach 
is useable for a rapid method of gene mapping in siru 
hybridization and nonradioactive procedures for de- 
5 tecting specific nucleic acid sequences by blotting hy- 
bridization methods. Use may be made of this technol- 
ogy in development of new clinical diagnostic proce- 
dures. 

Using this approach, it is possible to determine the 

10 presence of a specific deoxyribonucleic or ribonucleic 
acid molecule, particularly such a molecule derived 
from a living organism, e.g. bacteria, fungus, vims, 
yeast, or mammal This in turn permits diagnosis of 
nucleic acid-containing etiological agents in a patient or 

15 other subject. 

Moreover, it provides a method for screening bac- 
teria to determine antibiotic resistance. Thus, for exam- 
ple, penicillin resistance in Streptococcus pyogenes or 
Neisseris meningitidis; tetracycline resistance in Staphy- 

20 lococcus aureus, Candida albicans. Pseudomonas aerugi- 
nosa. Streptococcus pyogenes, or Neisseria gonorrhoeae; 
and aminoglycoside resistance in Mycobacterium tuber- 
culosis can be determined. 

In these methods a polynucleotide is prepared which 

25 is complementary to the nucleic acid sequence which 
characterizes the organism or its antibiotic resistance 
and which additionally includes one or more modified 
nucleotides according to this invention. This polynucle- 
otide is hybridized with nucleic acid obtained from the 

30 organism under scruntiny. Failure to hybridize indicates 
absence of the organism or of the resistance characteris- 
tic. Hybridized nucleic acid duplexes are then identified 
by forming a complex between the duplex and a suitable 
polypeptide which carries a detectable moiety, and 

35 detecting the presence of the complex using an appro- 
priate detection technique. Positive detection indicates 
that the complex, the duplex and therefore the nucleic 
acid sesequence of interest are present. 

This approach can be extended to the diagnosis of 

40 genetic disorders, such as thalassemia and sickle cell 
anemia. The deoxyribonucleotide acid gene sequence 
whose presence or absence (in the case of thalassemia) is 
associated with the disorder can be detected following 
hybridization with a polynucleotide probe according to 

45 this invention based upon complex formation with a 
suitable detectable polypeptide. 

The mapping of genes or their transcripts to specific 
loci on chromosomes has been a tedious and time-con- 
suming occupation, involving mainly techniques of 

50 cellfusion and somatic ceil genetics. Although in situ 
hybridization has been employed successfully for map- 
ping single-copy gene sequences in species that undergo 
chromosomes polytenization, such as Drosophila, de- 
tection of unique sequence genes in most higher eukary- 

55 otic chromosomes has been extremely difficult, if not 
impossible, using standard hybrization methods. The 
necessity for polynucleotide probes of very high spe- 
cific radioactivity to facilitate autoradiographic local- 
ization of the hybridization site also results in rapid 

60 radiodecomposirion of the probe and a concomitant 
increase in the background noise of silver grain deposi- 
tion. The use of hybridization probes with low to mod- 
erate specific radioactivities requires exposure times of 
many days or weeks, even to detect multicopy sequen- 

65 ces, such as ribosomal RNA genes or satellite DNA. 
Since recombinant DNA technology has made feasible 
the molecular cloning of virtually every single-copy 
sequence found in eukaryotic cells, it would be ex- 
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bridized with biotinized DNA probes will be incubated 
with streptavidin followed by a subsequent incubation 
with bovine serum albumin which has been doubly 
labeled with biotin and FTTC (FITC biotinyl-BSA). 
Since only one of the four streptavidin subunits is likely 
to be involved in binding at each biotinized DNA site, 
potentially one labeled BSA molecule can bind to each 
of the remaining three nonconjugated subunits of the 
streptavidin-biotinyl nucleotide complex. The fluores- 
cence signal from this single strep tavidin+FITC bi- 
otinylBSA layer will be compared with a control using 
the basic "antibody sandwich method" described ear- 

the "antibody sandwich" and streptavidin FTTC 
biotinyi-BSA detection intensities are comparable, one 
can attempt to enhance the streptavidin + FITC, bioti- 
nyl-BSA system to single-copy copy sensitivity in a 
manner that parallels the multiple "haptene-antibody 
sandwich" approach. Since some of bioun groups on 
BSA will not be bound to the first layer of streptavidin. 
a second layer of streptavidin can be added until suffi- 
cient signal is obtained. For example, if in the second 
layer only two streptavidin protomers bind to each 
first-layer BSA and each of these streptavidin protom- 
ers binds three FITC-biotinyl BSA molecules, then the 25 
second laver intensity will be twice as great as that from 
the first layer; for the third layer, with analogous bind- 
ing stoichiometrics, the fluorescent intensity will be 
12-fold that of the first layer, so the total intensity will 
rapidly increase with successively added layers. There 
are plans to use a larger carrier protein such as thyro- 
globulin rather than BSA in order to maximize amounts 
of attached fluorescent and biotin probes. It may also be 
necessary to use a longer linker arm between the biotin 
probe and the carrier protein. A longer linker arm 
should stericaily optimize the theoretical delivery of a 
biotinized fluorescent carrier molecule to each noncon- 
jugated streptavidin subunit and maximize the number 
of streptavidin protomers in the subsequent layer which 
will bind to the biotinized fluorescent carrier. As be- 
fore, appropriate controls will be done to insure that 
substitution of the carrier protein with fluorescent 
probes and biotin does not cause solubility and/or non- 
specific binding problems. 

The streptavidin-carrier protein delivery system has 45 
two significant advantages over the immunofluorescent 
approach in addition to its speed of delivery. First, only 
two protein components are needed to form the layers. 
Second, only the carrier protein needs to be modified 
and it is not necessary to maintain functional or even 50 
total structural integrity as long as the bioun groups are 
accessible to streptavidin. 

An alternative to the fluorescence method for visual- 
izing hybridized probes is to direct enzymes such as 
peroxidase, alkaline phosphatese of £-galactosidase to 55 
the hybridization site where enzymatic conversion of 
soluble substrates to insoluble colored precipitates per- 
mits light microscope visualization. The important ad- 
vantage of this technique is that the histochemical meth- 
ods are 10 to l<X>fold more sensitive than fluorescence 60 
detection. In addition, the colored precipitates do not 
bleach with extensive light exposure thus avoiding one 
of the general disadvantages of fluorescent light micros- 
copy. These enzymes can be coupled to the final anti- 
body instead of fluorescent probes in the "haptene-anti- 
body sandwich" technique using Afunctional reagents 
such as glutaraldehyde or in the case of peroxidase via 
oxidation of the peroxidase carbohydrate moieties to 
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aldehydes and coupling of these residues with £-amino 
groups of the desired protein. For the strep tavidin-bioti- 
nized carrier protein method, an enzyme with biotinyl 
groups coupled to it could replace a fluorescently-bioa- 
oized carrier system. Alternately, the enzyme could be 
coupled via biotin to the last layer of streptavidin with 
amplification of streptavidin sites being built up in pre- 
ceding layers using biotinized BSA or thyrogiobuhn. 
We will begin developing the necessary histochemical 
reagents and the appropriate substrate/insoluble prod- 
uct combinations for visualizing in situ hybridizations 
without background problems in the near future. The 
histochemical approaches to signal amplification should 
therefore be ready for trial in the summer of 1981. 

Detecting and/or imaging very low levels of fluores- 
cent light is possible using currendy available image 
intensifies or systems composed of lasers and photo- 
multipliers. These methods permit the detection or light 
down to the level of individual photons. With suitable 
digital processing systems, images can be produced in 
which each point, i.e. each pixel, of the image is strictly 
proportional to the number of photons emitted by a 
point at the object. Using systems of this kind or flow 
systems in which the cells or parts of cells flow past a 
laser beam, one can obtain detection sensitivity in- 
creases for fluorescent material of factors between 100 
and 1000 bevond that which can be detected by the eye. 
This increase is sufficient to detect the fluorescence of 

single copy genes. jtt-to a 

In a preferred modification, analogs of db 1 ¥ ana 
UTP that contain a biotin molecule covalentiy bound to 
the C-5 position of the pyrimidine ring through an allyi- 
amine linker arm have been synthesized. These biotinyl- 
nucleotides are efficient substrates for a variety of DNA 
and RNA polymerases in vitro. DNA containing low 
levels of biotin substitution (50 molecules or less/kilo- 
base) has denaturation, reassociation and hybridization 
characteristics which are indistinguishable from that ot 
unsubstituted control DNA. 

Thus, this invention also provides a method of chro- 
mosomal karyotyping. In this method, modified polynu- 
cleotides are prepared which correspond to known 
genes and include modified nucleotides. These polynu- 
cleotides are hybridized with chromosomal deoxyribo- 
nucleic acid and the resulting duplexes contacted with 
appropriate polypeptides under suitable conditions to 
permit complex formation. The polypeptides include 
detectable moieties so that the location of the complexes 
can be determined and the location of specific genes 

thereby fixed. 

Another embodiment of this invention involves de- 
tection of poly A-containing sequences using poly U in 
which some of the uracil bases have been modified to 
coritain a probe. Yet another embodiment involves cyc- 
lic modified nucleotides in which two of x, y, and are 
reacted to form the cyclic moiety 
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Such cyclic modified nucleotides may then be used to 
identify hormone receptor sites on cell surfaces which 
in turn can be used as a method of detecting cancer or 
rumor cells. 
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Finally, tumor ceils can be diagnosed by preparing 
polynucleotides which are modified according to this 
invention and are complementary to the messenger 
ribonucleic acid synthesized from a deoxyribonucleic 
acid gene sequence associated with the production of 
polypeptides, such as a- fetal protein or carcinoembry- 
onic antigen, the presence of which is diagnostic for 
specific tumor ceils. Hybridization and detection of 
hybrid duplexes thus would provide a method for de- 
tecting the tumor cells. 

The examples which follow are set forth to illustrate 
various aspects of the present invention but are not 
intended to limit in any way its scope as more particu- 
larly set forth in the claims. 

EXAMPLE 1 AND 2 

Synthesis of biotinyl-UTP and biotinyl-dUTP 

a) Preparation of Mercurated Nucleotides 
UTP (570 mg, 1.0 mmole) or dUTP 554 mg, 1.0 
mmoie) was dissolved in 100 ml of 0.1 M sodium acetate 
buffer pH 6.0, and mercuric acetate (1.59 gm. 5.0 
mmoles) added. The solution was heated at 50* C. for 4 
hours, then cooled on ice. Lithium chloride (392 mg, 9.0 



7.5. The desired product was in the major UV-absorb- 
ing portion which eiuted between 0.30 and 0.3 5M salt. 
Spectral analysis showed that this peak contained sev- 
eral products, final purification was achieved by reverse 
phase - HPLC chromatography on columns of P arris il- 
ODS2, using either 0.5M NH4H2PO4 buffer at pH 3.3 
(analytical separations), or 0.5M triethyiammonium 
acetate at pH 4.3 (preparative separations) as eluents. 
The 5'-triphosphates of 5-(3-aminopropen-l-yi) uridine 
(the allyiamine adduct to uridine) were the last portions 
to be eiuted from the HPLC column and they were 
clearly resolved from three, as yet uncharacterized, 
contaminants. These nucleotides were characterized by 
proton NMR elemental analysis AA-dUTP (Q2H16 N3 
O14P3 Na4.1 H 2 0): theory C, 22.91; H, 2.88; N, 6.68; P, 
14.77. Found, C 23.10; H, 2.85; N; 6.49; P, 14.75. AA- 
UTP (C12 Ht 6 N 3 Ou P 3 Na4. 4H z O): Theory, C 20.61; 
H. 3.46; N, 6.01; P, 13.3. Found C, 20.67; H, 4.11; N, 
5.39; P, 13.54] spectrally and chromatographicaily. 
c) Biotination of AA-dUTP or AA-UTP 
Biotinyl-N-hydroxysuccinimide ester (NHSB) was 
prepared from biotin (Sigma) as described previously 
(H. Heitzmann and F. M. Richards, Proc. Natl. Acad 
mmoles) was added and the solution extracted six tunes ^ USA . lx% 3537 [1974]) AA-dUTP.H 2 0 (63 me 0 1 

with an equal volume of ethyl acetate to remove excess 25 „ A A t-t-™ ^ tt ^ « . . 

HgCl 2 . The efficiency of the extraction process was 
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monitored by estimating the mercuric ion concentration 
in the organic layer using 4, 4'-bis (dimethylamino)-thi- 
obenzophenone (A. N. Christopher, Analyst, 94, 392 
(1969). The extent of nucleotide mercuration. deter- 
mined spectro photometrically following iodination of 
an aliquot of the aqueous solution as described by Dale 
et ai. (R. M. K. Dale. D. C. Ward, D. C. Livingston, and 
E. Martin, Nucleic Acid Res. 2, 915 [1975]), was rou- 
tinely between 90 and 100%. The nucleotide products 35 
in the aqueous layer, which often became cloudy during 
the ethyl acetate extraction, were precipitated by the 
addition of three volumes of ice-cold ethanol and col- 
lected by centrifugacion. The precipitate was washed 
twice with cold absolute ethanol, once with ethyl ether, 
and then air dried. These thus prepared mercurated 
nucleotides were used for the synthesis of the ailyla- 
mine-nucleotides without further purification, 
b) Synthesis of allylaraine-dUTP and ailylamine-UTP 
The mercurated nucleotides (of step a) were dis- 
solved in 0. 1M sodium acetate buffer at pH 5.0, and 
adjusted to a concentration of 20 mM (200 OD/mi at 267 
nm). A fresh 2.0 M solution of allyiamine acetate in 



mmole) or AA-UTP.4H2O (70 mg, 0.1 mmoie) was 
dissolved in 20 ml of 0.1 M sodium borate bufTer at pH 
8.5, and NHSB (34.1 mg, 0.1 mmole) dissolved in 2 mi 
of dimethyl formamide. was added. The reaction mix- 
ture was left at room temperature for four hours and 
then loaded directly onto a 30 ml column of DEAE- 
Sephadex TM A-25, preequiiibrated with 0.1M TEAB 
at pH 7.5. The column was eiuted with a 400 ml linear 
gradient (0.1-0.9M) of TEAB. Fractions containing 
biotinyl-dUTP or biotinyl-UTP, which eiuted between 
0.55 and 0.65M TEAB, were desalted by rotary evapo- 
ration in the presense of methanol and redissolved in 
water. Occassionally a slightly cloudy solution was 
obtained: this turbidity, due to a contaminant in some 
4Q TEAB solutions, was removed by filtration through a 
0.45 mm filter. For long term storage, the nucleorides 
were convened to the sodium salt by briefly stirring the 
solution in the presence of Dowex TM 50 (Na^ form). 
After filtration the nucleotide was precipitated by the 
addition of three volumes of cold ethanol, washed with 
ethyl ether, dried in vacuo over sodium hydroxide pel- 
lets, and stored in a dessicator at — 20* C. For immedi- 
ate use, the nucleotide solution was made 20 mM in 
Tris-HCl at pH 7.5, and adjusted to a final nucleotide 
mM. Stock solutions were stored 
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aqueous acetic acid was prepared by slowly adding 1.5 

ml of allyiamine (13.3 mmoles) to 8.5 ml of ice-cold 4M 50 concentration of 5 
acetic acid. Three mi (6.0 mmoles) of the neutralized frozen at —20* C. 

allyiamine stock was added to 25 ml (0.5 mmoie) of Elemental analysis of the bio-dUTP and bio- UTP 
nucleotide solution. One nucleotide equivalent of ; products yielded the following results. Bio-dUTP (Cu 
K 2 PdCLu (163 mg. 0.5 mmoie), dissolved in 4 ml of , H30N5O18P3S1 Na*.l H 2 0). Theoretical; C, 29.80; H. 
water, was then added to initiate the reaction. Upon 55 3.38; N, 7.89; P, 10.47; S. 3.61. Found; Q 30.14 H, 3.22; 



addition of the palladium salt (Alfa-Ventron) the solu- 
tion gradually turned black with metal (Hg and Pd) 
deposits appearing on the walls of the reaction vessel. 
After standing at room temperature for 18-24 hours, the 
reaction mixture was passed through a 0.45 mm mem- 60 
brane filter (nalgene) to remove most of the remaining 
metal precipitate. The yellow filtrate was diluted five- 
fold and applied to a 100 ml column of DEAE- 
Sephadex TM A-25 (Pharmacia). After washing with 
one column voiume of 0.1 M sodium acetate buffer at 65 
pH 5.0, the products were eiuted using a one liter linear 
gradient (0.1-0.6M) of either sodium acetate at pH 
— 8-9, or triethyiammonium bicarbonate (TEAB) at pH 



N, 7.63; P, 10.31; S, 3.70. Bio-UTP (C 22 H30 N 3 O t9 P3 
Si Na*J H2O): Theoretical; C, 29.15; H. 3.19; N, 7.45; 
P, 9.89; S, 3.41. Found; C, 28.76; H, 3.35; N, 7.68; P, 
9.81; S, 3.32. 

The spectral properties of bio-dUTP and bio- UTP at 
pH 7.5 [k max, 289 nm (* = 7,100); u, max, 240 nm 
(€= 10,700); X min, 262 nm (€ = 4,300)] reflect the pres- 
ence of an exocylic double-bond in conjugation with 
the pyrimidine ring. These nucleotides also give a 
strong positive reaction (an orange -red color) when 
treated with p-dimethylaminocinnamaldehyde in etha- 
noiic sulfuric acid, a procedure used for biotin quantita- 
tion (D. B. McCormick and J. A. Roth, Anal. Biochem.. 
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34, 326, 1970). However, they no longer react with 
ninhydrin, a characteristic reaction of the AA-dUTP 
and AA-UTP starting materials. 

EXAMPLES 3 and 4 5 
Synthesis of biotinyl-CTP and biotinyl-dCTP 

CTP and dCTP were a) mercurated, b) reacted with 
ailylamine, and c) biotinized with NHS-bioun, essen- 
tially as described in Example 1- CTP (56.3 mg, 0.1 
mmole) or dCTP (59.1 mg, 0.1 mmole) were dissolved 10 
in 20 ml of 0.1 M sodium acetate buffer at pH 5.0, and 
mercuric acetate (0.159 gm, 0.5 mmoles) added. The 
solution was heated at 50* C. for 4.5 hours then cooled 
on ice. Lithium chloride (39.2 mg, 0.9 mmoles) was 
added and the solution extracted 6 times with ethyl 13 
acetate. The nucleotide products in the aqueous layer 
were precipitated by the addition of three volumes of 
cold ethanol and the precipitate collected by centrifuga- 
tion. The precipitate was washed with absolute ethanol. 
ethyl ether, and then air dried. These products were 20 
used without further purification for the synthesis of 
AA-CTP and AA-dCTP, respectively. The mercurated 
nucleotides were dissolved in 0.1M sodium acetate 
buffer at pH 5.0 and adjusted to a concentration of 10 
mM (92 OD/ml at 275 nm). 0.6 ml (1.2 mmole) of a 25 
2.0M ailylamine acetate stock (prepared as described in 
Example 1) was added to 10 ml of nucleotide solution 
(0.1 mmole) followed by the addition of K2PdCU (32.6 
mg, 0.1 mmole), dissolved in 1.0 ml of H2O. After stand- 
ing at room temperature for 24 hours, the solution was 30 
filtered through a 0.45 mM membrane to remove metal 
precipates. The filtrate was diluted five -fold and loaded 
onto a 50 ml column of DEAE-sephadex A-25, 
preequilibrated with 50 mM TEAB at pH 7.5. The 
nucleotide products were fractionated by application of 35 
a 500 ml linear gradient (0.05-0.6M) of TEAB at pH 7.5. 
The desired product was in the major UV absorbing 
portion which eluted between 0.28 and 0.38M salt. The 
pooled samples were desalted by rotary evaporation, 
dissolved in 0.5M triethylammonium acetate at pH 4.2, 40 
and final purification achieved by HPLC chromatogra- 
phy on columns of Partisil ODS-2, using 0.5M trie- 
thylammonium acetate as the eluent. Appropriate frac- 
tions were pooled, Iyophilized, and the products dis- 
solved in H2O. The nucleotides were converted to the 45 
Na+ salt by stirring briefly in the presence of Dowex 
TM 50 (Na- form). After filtration, to remove the 
Dowex resin, the nucleotides were precipitated by the 
addition of 3 volumes of cold ethanol. The precipitate 
was washed with ether and then air dried. Analytical 50 
results: AA-dCTP (Cu Hn N 4 On P3 NaUH^O); The- 
ory, C, 22.29; H, 2.63; N, 8.67, P, 14.40. Found C 22.16; 
H. 2.89; N. 8.77; P, 14.18. AA-CTP (C12 Hn N 4 Ou 
NaUH 2 0); Theory C 21.75; H. 2.57; N, 8.4*; P, 14.01. 
Found. C, 22.03; H, 2.47; N, 8.69; P, 13.81; Spectral 55 
properties in 0.1 M Borate buffer at pH 8.0, X max 301 
nm (€=6,400), X min 271 nm (€ = 3,950) X max 250 nm 
(€==9,700). Both AA-dCTP and AA-CTP give a posi- 
tive ninhydrin test. 

AA-CTP (6.6 mg, 0.01 mmole) or AA-dCTP (6.4 mg, 60 
0.01 mmole) was dissolved in 5 ml of 0.1 M sodium 
borate buffer at pH 8.5, and NHS-biotin (3.4 mg, 0.01 
mmole), dissolved in 0.2 ml of dimethylformamide, was 
added. After sitting at room temperature for 4 hours the 
sample was chromaiographed on a 10 ml column of 65 
DEAE-Sephadex A-25, using a 150 ml linear gradient 
(0.1-0.9M) of TEAB at pH 7.5. as eluent. Fractions 
containing biotinyl-CTP or biotinyl-dCTP, which 
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eluted between 0.50 and 0.60M TEAB, were pooled, 
desalted by rotary evaporation, and after being adjusted 
to a final concentration of 5 mM in 0.02M Tris-HCl 
buffer at pH 7.5, were frozen at -20* C. The products 
give a strong positive reaction for biotin with p-dime- 
thyiaminocinnamldehyde in ethanolic sulfuric acid but 
give a negative test for primary amines when sprayed 
with ninhydrin. Further structural characterization of 
these products is in progress. 

EXAMPLES 5 and 6 

Synthesis of Irninobiotinyl-UTP and 
Iminobiotinyl-dUTP 

Iminobiotin hydrobromide was prepared from biotin 
as described previously (K. Hofmann, D. B. Melville 
and V. du Vigneaud, J. Biol. Chem., 141, 207-21 1, 1941; 
K. Hofmann and A. E. Axelrod, Ibid., 187, 29-33, 
1950). The N-hydroxysuccinimide (NHS) ester of 
iminobiotin was prepared using the protocol previously 
described for the synthesis of NHS-Biotin (H. Heitz- 
mann and F. M. Richards, Proc. Nat. Acad. Sci. USA, 
71, 5537, 1974). AA-UTP (7.0 mg, 0.01 mmole) or AA- 
dUTP (6.3 mg, 0.01 mmole), prepared as detailed in 
example 1 (pan b), was dissolved in 5 ml of 0.1 M so- 
dium borate buffer at pH 8.5, and NHS-iminobiotin (3.5 
mg, 0.01 mmole), dissolved in 0.5 ml of dimethylform- 
amide. was added. The reaction mixture was left at 
room temperature for 12 hours and then loaded directly 
onto a 10 ml column of DEAE-Sephadex A-25, 
preequilibrated with 0.05M TEAB at pH 7.5. The col- 
umn was eiuted with a 150 ml linear gradient 
(0.05-O.6M) of TEAB. Fractions containing iminobio- 
tin-UTP or iminobiotin-dUTP, which eluted between 
0.35 and 0.40M TEAB, were desalted by rotary evapo- 
ration in the presence of methanol and dissolved in 
H2O. The products contained a small amount of allyla- 
mine-nucleotide adduct as an impurity, as judged by a 
weak positive result in the ninhydrin test. Final purifica- 
tion was achieved by affinity chromatography on avi- 
dinsepharose. Fractions of the impure product, made 
0. 1M in soldium borate buffer at pH 8.5, were applied to 
a 5 ml column of avidin-sepharose and washed with 25 
ml of the same buffer. The column was then washed 
with 50 mM ammonium acetate buffer at pH 4.0, which 
eluted the desired iminobiotin-nucleotide product in a 
sharp peak. The nucleotide was precipitated by the 
addition of 3 volumes of cold ethanol, washed with 
ethylether, dried in vacuo over sodium hydroxide pel- 
lets and stored in a dessicator at —20* C. Products were 
characterized by elemental analysis, as well as by spec- 
tral and chromographic properties. 

EXAMPLES 7 AND 8 

Synthesis of NAGE-UTP and NAGE-dUTP 

Allyl (3-amino-2-hydroxy-)propyI ether, abbreviated 
NAGE, was prepared from allyl glycidyl ether (Age) 
(obtained from Aldrich Chemical Co.). 10 ml of Age (84 
mmole) was added slowly (in a fume hood) to 50 ml of 
9M ammonium hydroxide and the mixture allowed to 
stand at room temperature for six hours. Excess ammo- 
nia was removed by rotary evaporation under reduced 
pressure to yield a viscous yellow oil. Analysis of this 
product by proton NMR showed that it possessed the 
required structure. 5-mercuri-dUTP (0.1 mmole) or 
5-mercuri-UTP (0.2 mmole) was dissolved in 2—* ml of 
0.2M sodium acetate buffer at pH 5.0, and a 16 fold 
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molar excess of NAGE adjusted to pH 5.0 with acetic 
acid prior to use, was added. The flnaJ reaction volumes 
(4.3 and 8.4 ml) had nucleotide concentrations of 43 and 
42 mM, respectively. One equivalent of K2PdCU(0.1 or 
0.2 mmoles) was added to initiate the reaction. After 5 
standing at room temperature for 18 hours, the reaction 
mixtures were filtered through 0.45 u.mM membranes 
the samples diluted five- fold, and chromato graphed on 
columns of DEAE-Sephadex A-25, using linear gradi- 
ents (0. 1-0. 6M) of sodium acetate. Fractions containing \q 
the desired products, as judged by their UV spectra and 
characteristic HPLC eiution profiles on Partisil ODS-2, 
were pooled, diluted, and further purified by re- 
chromatography on DEAE-Sephadex using shallow 
gradients (0.1-0.5M) of ammonium bicarbonate at pH 15 
8.5. Under these conditions the majority of the NAGE- 
dUTP (or NAGE-UTP) could be cleanly separated 
from residual impurities. Proton NMR spectra were 
obtained at this stage of purification after the nucleo- 
tides were Iyophilized and redissolved in D2O. For ele- 20 
mental analysis, the products were converted to their 
sodium salt form. Typical analytical results: Nage- 
dUTP (C15 H22 N 3 Oi6 P3 Nat. 2 H 2 0), Theory, C 
24.99; H, 3.63; N, 5.83; P, 12.88. Found, C, 25.39; H, 
3.71; N, 5.63; P, 12.88 25 

EXAMPLE 9 
Uses of Labeled DNA Sequences 
I. Karyotyping 

(a) Select from a human gene library some 100 to 200 30 
clones. Label them as described above, and for each 
clone locate its place or places of hybridization visually 

or with a low-light- level video system. For those clones 
which correspond to a unique sequence gene this deter- 
mines the location of the cloned DNA on a particular 35 
human chromosome. Obtain several clones for each 
chromosome. Each of these labeled clones can be used 
to identify particular chromosomes. They can also be 
used in combination to identify each of the 46 chromo- 
somes as being one of the 22 autosomal pairs or the X or 40 
the Y. By allowing one set of labeled clones to hybridize 
to the chromosomes and then adding a fluorescent stain 
to the label, the set of clones and their locations can be 
visualized and will flouresce with a particular color. A 
second set of labeled clones could then be used and 45 
reacted with a second fluorescent dye. The same pro- 
cess can be repeated a number of times. Thus one can. if 
desired, have several sets of fluorescent labels attached 
to the cellular DNA at different but specific locations 
on each of the chromosomes. These labels could be used 50 
for visual or computerized automatic karyotyping. 

(b) For automatic karyotyping, one could use one set 
of clones to identify the approximate location of each of 
the 46 chromosomes by finding sets of spots corre- 
sponding to the number of labeling sites on each chro- 55 
mosome. Thus, it is possible by computer analysis of the 
digitized images to determine if the chromosomes are 
suitably spread for further analysis. If they are suitably 
spread, then one can use computer analysis to identify 
each of the individual chromosomes by the location and 60 
distribution of the labelled spots on each one. 

By using the fact that the fluorescent spots can be 
placed at specific locations on each chromosome, one 
can carry out either manual or automatic karyotyping 
very much more effectively than without such labels. 65 

II. Diagnosis of Genetic Disorders 

By selecting the clones which bind specifically to a 
particular chromosome, such as number 23, it is possible 



to count the number of copies of the particular chromo- 
some in a cell even if the chromosomes, are not con- 
densed at metaphase. Thus when fetal cells are obtained 
for prenatal diagnosis of trisomy 21, the diagnosis can 
be done even if the chromosomes are not condensed at 
metaphase. If necessary, two sets of labels can be 
used—one which would be specific for chromosome 23 
and one for some other chromosome. By measuring in 
each cell the ratio of the two labels, which might be of 
different colors, it is possible to identify the cells which 
show an abnormal number of chromosomes number 23. 
This procedure could be used either on slides with a 
low-light-level video system or in a flow cytometer 
system using laser excitation. It can be used to deter- 
mine any abnormal chromosome number. 

III. Microorganism Detection and Identification 
The labeling of specific sequences of DNA as de- 
scribed above permits identification and counting of 
individual bacteria. In order to identify the individual 
bacteria to which a particular fragment of DNA hybrid- 
izes the sensitivity must be such that a single labelled 
structure can be detected. This can be done using a 
low-light-level video system and computer summation 
of images, or by using some other device for intensify- 
ing the light image. A flow system can also be used if 
the sensitivity can be made sufficiently grand. If one 
immobilized the bacteria on a slide their location could 
be found and the number of such fluorescent spots 
counted. This would provide a count of all of those 
bacteria which contain DNA which can hybridize with 
the specific clone utilized. If the clone is selected as 
being specific for a particular strain or bacteria, then 
one can count the number of organisms of that strain. In 
addition, any antibiotic resistance for which a particular 
gene has been identified could be characterized in a 
similar way using, as a probe, the DNA sequence which 
is contained in the antibiotic resistance gene. In addi- 
tion, a probe could be used which is specific for a resis- 
tance plasmid containing one or more antibiotic resis- 
tance genes. In addition to individual bacteria, groups of 
bacterial cells of a particular strain can be detected and 
their number estimated if they are located in a small spot 
so that the total fluorescence specific to the hybridized 
DNA in the spot can be measured. In this way the 
number of organisms containing a specific DNA se- 
quence can be measured in a mixture of bacteria. 
We claim: 

1. A method of detecting the presence or absence of 
a nucleic acid in a sample which comprises the steps of 
(a) contacting under hybridizable conditions said 

sample with at least one compound comprising the 

structure: 
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10 

wherein each of B' and B" represents a purine, 7- 
deazapurine, or pyrimidine moiety covalentiy 
bonded to the C l '-position of the sugar moiety, 
provided that whenever B' or B" is purine or 7- 15 
deazap urine, the sugar moiety is attached at the 
N'-position of the purine or 7-deazapurine, and 
whenever B' or B" is pyrimidine the sugar moiety 
is attached at the N 1 -position of the pyrimidine; 

wherein B represents 7-deazapurine or pyrimidine 20 
moiety covalentiy bonded to the C 1 '-position of the 
sugar moiety, provided that whenever B is 7- 
deazap urine, the sugar moiety is attached at the 
N 9 -position of the 7-deazapurine, and whenever B 
is pyrimidine the sugar moiety is attached at the 25 
N l -position of the pyrimidine; 

wherein A comprises at least three carbon atoms and 
represents at least one component of a signalling 
moiety capable of producing a detectable signal; 

wherein B and A are covalentiy attached directly or 30 
indirectly through a linkage group, said linkage 
group not interfering substantially with the charac- 
teristic ability of said compound to hybridize with 
said nucleic acid or of A to be detected; 

wherein if B is 7-deazapurine, A is attached to the 35 
7-position thereof, and if B is pyrimidine, A is at- 
tached to the 5-position thereof; 

wherein m, n and p are integers, provided that m and 
p are not simultaneously 0 and provided further n is 
never 0; and 40 

wherein z represents H — or HO — ; and 

(b) detecting said compound or compounds so as to 
detect said nucleic acid. 

2. The method of claim 1 wherein the moiety A is a 
ligand. 45 

3. The method of claim 2 wherein the ligand is se- 
lected from the group consisting of a hapten, an antigen, 
a cofactor, biotin and irninobiotin. 

4. The method of claim 2 wherein the ligand is se- 
lected from the group consisting of dinitrophenol, lipoic 50 
acid and an olefinic compound. 

5. The method of claim 2 wherein the ligand is capa- 
ble of forming a complex by binding with a detectable 
polypeptide. 

6. The method of claim 5 wherein the detectable 55 
polypeptide is selected from the group consisting of an 
antibody, an enzyme capable of depositing insoluble 
reaction products, strep tavidin and avidin. 

7. The method of claim 5 wherein the sample is con- 
tacted with the detectable polypeptide after hybridiza- 60 
tion of the compound or compounds to said nucleic acid 
under suitable conditions as to form the complex. 

8. The method of claim 5 wherein an indicator mole- 
cule is associated with or bound to the detectable poly- 
peptide. 65 

9. The method of claim 8 wherein the indicator mole- 
cule is fluorescent, electron dense, or an enzyme capa- 
ble of depositing insoluble reaction products. 



10. The method of claim 9 wherein the enzyme is 
selected from the group consisting of alkaline phospha- 
tase, peroxidase and 6-galactosidase. 

11. The method of claim 9 wherein the fluorescent 
indicator molecular is selected from the group consist- 
ing of fluorescein and rhodamine. 

12. The method of claim 9 wherein the electron dense 
indicator molecule is selected from the group consisting 
of ferritin, hetnocyanin and colloidal gold. 

13. The method of claim 8 wherein the indicator 
molecule is covalentiy linked to the detectable polypep- 
tide. 

14. The method of claim 8 wherein the detectable 
polypeptide is indirectly detectable by specifically com- 
plexing the detectable polypeptide with a second poly- 
peptide covalendy linked to an indicator molecule. 

15. The method of claim 14 wherein said detectable 
polypeptide is selected from the group consisting of 
avidin and strep tavidin and the second polypeptide is 
selected from the group consisting of biotin and 
irninobiotin. 

16. The method of claim 5 wherein at least one of said 
compounds is labeled with a first indicator molecule 
and at least one other of said compounds is labeled with 
a second indicator molecule. 

17. The method of claim 16 wherein the compound 
labeled with the first indicator molecule is allowed to 
hybridize to the nucleic acid and is detected and then 
the compound labeled with the second indicator mole- 
cule is allowed to hybridize to the nucleic acid and is 
detected. 

18. The method of claim 1 wherein the moiety A 
comprises an indicator molecule. 

19. The method of claim 18 wherein said indicator 
molecule is fluorescent, electron dense, or is an enzyme 
capable of depositing insoluble reaction products. 

20. The method of claim 19 wherein the enzyme is 
selected from the group consisting of alkaline phospha- 
tase, peroxidase and £-gaiactosidase. 

21. The method of claim 19 wherein the fluorescent 
indicator molecule is selected from the group consisting 
of fluorescein and rhodamine. 

22. The method of claim 19 wherein the electron 
dense compound is selected from the group consisting 
of ferritin, hemocyanin and colloidal gold. 

23. A method of claim 1 wherein said signalling moi- 
ety is capable of producing a detectable signal when the 
compound is incorporated into a double-stranded ribo- 
nucleic acid, deoxyribonucleic acid duplex or DNA- 
RNA hybrid. 

24. The method of claim 1 wherein said detecting step 
(b) is carried out when the compound is hybridized to 
the nucleic acid. 

25. The method of claim 1 wherein said nucleic acid 
is immobilized on a solid support. 

26. The method of claim 1 wherein the moiety A 
comprises at least 5 carbon atoms. 

27. The method of claim 1 wherein the moiety A is 
non-aromatic. 

28. The method of claim 1 wherein B is selected from 
the group consisting of uracil, cytosine, deazaadenine, 
deazaguanine. 

29. The method of claim 1 wherein the linkage group 
comprises an olefinic bond at the a-position relative to 
B. 

30. The method of claim 29 wherein the linkage 
group comprises the moiety — CH=CH — CH- 
2 _NH— . 



33 



5,328,824 



34 



31. The method of claim 29 wherein the linkage 
group comprises the moiety 



— CH=CH— CH 2 — O— CHt— CH— CH 2 — NH— . 

OH 



32. The method of claim 1 wherein said nucleic acid 
is derived from a living organism. 

33. The method of claim 32 wherein said living 10 
organism is selected from the group consisting of 
prolcaryotes and eukaryotes. 

34. The method of claim 1 wherein said sample is 
suspected of containing an etiological agent and said 



nucleic acid is associated with said etiological agent. 

35. The method of claim 34 wherein said sample is of 
human or animal origin and said etiological agent is 
selected from the group consisting of bacteria, viruses 
and fungi. 

36. The method of claim 1 wherein said sample 
comprises a microorganism suspected of containing a 
nucleic acid which imparts resistance to an antibiotic 
and wherein said compound comprises a polynucleo- 
tide complementary to the nucleic acid of said microor- 
ganism which confers resistance to said antibiotic. 

37. The method of claim 36 wherein when said 
micoorganisra is Streptococcus pyrogenes or Neisseria 
meningitidis, said antibiotic is penicillin, wherein when 
said microorganism is Staphylococcus aureus, Candida 
albicans, Pseudomonas aeruginosa, Streptococcus pyr- 



15 



20 



25 
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ogenes, or Neisseria gonorrhoeae, said antibiotic is a 
tetracycline, and wherein when said microoganism is 
Mycobacterium tuberculosis, said antibiotic is an amin- 
oglycoside. 

33. The method of claim 1 wherein said sample is 
suspected of containing a nucleic acid associated with a 
genetic disorder and wherein said compound comprises 
a polynucleotide complementary to the nucleic acid 
associated with said genetic disorder. 

39. The method of claim 1 wherein said sample is 
suspected of containing a nucleic acid associated with 
or absent in thalassemia and wherein said compound 
comprises a polynucleotide complementary to the nu- 
cleic acid which is associated with or absent in thalasse- 
mic subjects. 

40. The method of claim 1 for chromosomal karyo- 
typing which comprises contacting said sample with a 
series of said compounds which are complementary to 
a series of known genetic nucleic acids located on 
chromosomes. 

41. The method of claim 1 wherein said sample is 
suspected of containing a nucleic acid which codes for 
expression of a polypeptide associated with a tumor 
cell and wherein said compound comprises a polynu- 
cleotide complementary to the messenger ribonucleic 
acid transcribed from a deoxyribonucleic acid associat- 
ed with the production of said polypeptide. 

42. The method of claim 41 wherein said polypeptide 
is a-fetal protein. 
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